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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: LABORATO I RE S GOEMAR S.A. 

(B) STREET: La Madeleine B.P. 55 

(C) CITY: Saint-Malo 
{ E ) COUNTRY : Franc e 

(F) POSTAL CODE (ZIP) : 3 5413 Cedex 

(G) TELEPHONE: 99 21 53 70 

(H) TELEFAX: 99 82 56 17 

(ii) TITLE OF INVENTION: Glycolyse hydrolase genes and their use for 
producing enzymes for the biodegradtion of carrageenans 

{iii) NUMBER OF SEQUENCES: 8 

<iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

<C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2085 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO \ 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join (211 . .1683, 1880. .2083) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

AAGCTTTCCG ATTCTATCAT CGAAGTCATA GGAGTGGGTA AACAAAAAAG CATGAAACTA 60 
GCTTTTTAAA ATACAGACTT TCAATATAGG TCGCACACAA TATTAACGAA TAAATAAGCA 120 



AATCATATAC ATAATCATTG CTTTAAATAT GTTTTAATAC AGATATAAAC ATAGTATGTT 180 
TGTGTTTTTG GTATCTATCG GAGTGAAAAC ATG CGC TTA TAT TTT AGA AAG TTG 234 

Met Arg Leu Tyr Phe Arg Lys Leu 
1 5 

TGG TTA ACA AAT TTA TTT TTA GGC GGA GCA CTG GCC TCT TCA GCT GCG 2 82 

Trp Leu Thr Asn Leu Phe Leu Gly Gly Ala Leu Ala Ser Ser Ala Ala 
10 15 20 

ATA GGG GCT GTC TCC CCC AAG ACT TAT AAG GAC GCA GAT TTT TAT GTT 33 0 

lie Gly Ala Val Ser Pro Lys Thr Tyr Lys Asp Ala Asp Phe Tyr Val 
25 30 35 40 

GCC CCT ACT CAA CAA GAT GTT AAC TAT GAT TTA GTT GAT GAT TTT GGC 37 8 

Ala Pro Thr Gin Gin Asp Val Asn Tyr Asp Leu Val Asp Asp Phe Gly 
45 50 55 

□ GCT AAT GGA AAC GAC ACT AGT GAT GAC AGT AAT GCT TTA CAA AGA GCA 426 

2 Ala Asn Gly Asn Asp Thr Ser Asp Asp Ser Asn Ala Leu Gin Arg Ala 

1% 60 65 70 

\fi 

y ATT AAT GCT ATT AGT AGA AAA CCG AAT GGG GGC ACT TTA CTA ATA CCG 47 4 

j lie Asn Ala lie Ser Arg Lys Pro Asn Gly Gly Thr Leu Leu lie Pro 

A 75 80 ' 85 



AAT GGA ACT TAG CAT TTC CTC GGC ATA CAG ATG AAG TCG AAC GTA CAC 52 2 

Asn Gly Thr Tyr His Phe Leu Gly lie Gin Met Lys Ser Asn Val His 
90 95 100 

ATC CGT GTT GAG AGT GAC GTG ATA ATC AAG CCA ACG TGG AAT GGG GAT 57 0 

lie Arg Val Glu Ser Asp Val lie lie Lys Pro Thr Trp Asn Gly Asp 
105 110 115 120 

GGC AAA AAC CAC CGA CTA TTT GAA GTT GGC GTA AAC AAT ATT GTA AGA 618 
Gly Lys Asn His Arg Leu Phe Glu Val Gly Val Asn Asn lie Val Arg 
125 130 135 

t 

AAC TTC AGC TTT. CAA GGG TTA GGA AAC GGT TTT TTG GTG GAT TTT AAA < 666 
Asn Phe Ser Phe Gin Gly Leu Gly Asn Gly Phe Leu Val Asp Phe Lys 
140 145 150 

GAT TCT CGC GAC AAA AAC TTA GCT GTT TTT AAG TTA GGC GAT GTT AGA 714 
Asp Ser Arg Asp Lys Asn Leu Ala Val Phe Lys Leu Gly Asp Val Arg 
155 160 165 



AAT TAC AAA ATT TCC AAT TTT ACC ATT GAT GAT AAT AAA ACG ATA TTT 7 62 

Asn Tyr Lys lie Ser Asn Phe Thr lie Asp Asp Asn Lys Thr lie Phe 
170 175 180 

GCC TCA ATT TTA GTG GAC GTA ACA GAA CGT AAT GGG CGG TTA CAT TGG 810 
Ala Ser lie Leu Val Asp Val Thr Glu Arg Asn Gly Arg Leu His Trp 
185 190 195 200 

TCG CGT AAT GGA ATT ATC GAA AGA ATA AAA CAA AAT AAC GCT TTG TTC 858 
Ser Arg Asn Gly lie lie Glu Arg lie Lys Gin Asn Asn Ala Leu Phe 
205 210 215 

GGC TAC GGC CTT ATT CAA ACC TAT GGC GCA GAT AAT ATT TTG TTT AGG 906 
Gly Tyr Gly Leu lie Gin Thr Tyr Gly Ala Asp Asn lie Leu Phe Arg 
220 225 230 

AAC CTC CAT TCG GAA GGC GGA ATT GCG TTA CGG ATG. GAA ACT GAC AAC 954 
Asn Leu His Ser Glu Gly Gly lie Ala Leu Arg Met Glu Thr Asp Asn 
235 240 245 

TTA CTT ATG AAA AAT TAT AAG CAA GGC GGA ATA AGA AAC ATC TTT GCT 1002 
Leu Leu Met Lys Asn Tyr Lys Gin Gly Gly lie Arg Asn lie Phe Ala 

p 250 255 260 

a GAT AAT ATC AGA TGT AGC AAA GGA CTT GCG GCG GTC ATG TTT GGC ' CCA 1050 

Asp Asn lie Arg Cys Ser Lys Gly Leu Ala Ala Val Met Phe Gly Pro 
265 270 275 280 



: ; -ft; 



CAT TTT ATG AAG AAT GGA GAT GTG CAA GTG ACC AAT GTC AGC TCA GTT 1098 
His Phe Met Lys Asn Gly Asp Val Gin Val Thr Asn Val Ser Ser Val 
285 290 295 

AGT TGC GGT TCG GCT GTA CGA AGT GAT AGT GGA TTT GTC GAA CTC TTT 1146 
Ser Cys Gly Ser Ala Val Arg Ser Asp Ser Gly Phe Val Glu Leu Phe 
300 305 310 

AGC CCG ACA GAC GAA GTA CAT ACG CGT CAA AGT TGG AAA CAA GCC GTT 1194 

t; 

Ser Pro Thr Asp Glu Val His Thr Arg Gin Ser Trp Lys Gin Ala Val *• 
315 320 325 

GAA AGT AAA TTG GGC CGA GGG TGT GCG CAA ACC CCT TAT GCT AGA GGT 1242 
Glu Ser Lys Leu Gly Arg Gly Cys Ala Gin Thr Pro Tyr Ala Arg Gly 
330 335 340 



20 



AAT GGT GGT ACA CGG TGG GCG GCT CGC GTA ACA CAA AAA GAC GCG TGT 1290 
Asn Gly Gly Thr Arg Trp Ala Ala Arg Val Thr Gin Lys Asp Ala Cys 
345 350 355 360 

TTA GAT AAA GCA AAA CTG GAA TAT GGA ATA GAG CCT GGT TCA TTT GGC 133 8 

Leu Asp Lys Ala Lys Leu Glu Tyr Gly lie Glu Pro Gly Ser Phe Gly 
365 370 375 

ACG GTT AAA GTC TTT GAT GTT ACA GCG CGT TTT GGT TAT AAC GCA GAT 13 8 6 

Thr Val Lys Val Phe Asp Val Thr Ala Arg Phe Gly Tyr Asn Ala Asp 
380 385 390 

CTT AAA CAG GAC CAG CTA GAC TAC TTT TCT ACA TCC AAC CCT ATG TGC 143 4 

Leu Lys Gin Asp Gin Leu Asp Tyr Phe Ser Thr Ser Asn Pro Met Cys 
395 400 405 



AAG CGT GTA TGC CTT CCT ACA AAA GAA CAA TGG AGT AAG CAA GGC CAA 1482 
Lys Arg Val Cys Leu Pro Thr Lys Glu Gin Trp Ser Lys Gin Gly Gin 
410 415 420 



ATT TAC ATT GGT CCG TCA TTA GCT GCA GTA ATT GAT ACC ACA CCT GAA 153 0 

lie Tyr lie Gly Pro Ser Leu Ala Ala Val lie Asp Thr Thr Pro Glu 
425 430 435 440 

ACT TCA AAA TAC GAT TAT GAT GTG AAA ACT TTT AAC GTC AAA AGA ' ATA 1578 
Thr Ser Lys Tyr Asp Tyr Asp Val Lys Thr Phe Asn Val Lys Arg lie 
445 450 455 

AAT TTT CCT GTA AAT TCA CAC AAG ACT ATC GAC ACG AAT ACT GAA AGT 162 6 

Asn Phe Pro Val Asn Ser His Lys Thr lie Asp Thr Asn Thr Glu Ser 
460 465 470 

AGC CGT GTC TGC AAT TAT TAC GGT ATG TCC GAA TGC TCC AGC AGT CGA 1674 
Ser Arg Val Cys Asn Tyr Tyr Gly Met Ser Glu Cys Ser Ser Ser Arg 
475 480 485 

TGG GAG CGA TAGATTAAGC CGCTATATTC ATTTACTAGG TAAAAC TTC A , 1723 

Trp Glu Arg 
490 

AGCCGCATTC G AAG AAC TAT CGAACGCGGC TTTTTTGTTA AGAGCGCCTA TGACTCAGTA 1783 
TATTTTGTAT AAATATAATT TTACATCTTG TTAAAGTAAA CATCATATGT TTATATAGGT 1843 
GCAATCTAAT TTGTTAATAT AGTGTTGGAG ATAGGT ATG AAA GGT GTT TCT ACG 1897 

Met Lys Gly Val Ser Thr 
495 



AAA AAT GCT CTT TTA TTT GCA GGC TTT TCG TTA AGT CTA GTT GCA CAG 
Lys Asn Ala Leu Leu Phe Ala Gly Phe Ser Leu Ser Leu Val Ala Gin 
500 505 510 



TCA GTT AGT GCA CAA GAA GCA AAA CAG CCT GAA AAA GAA GAA AAA GAT 
Ser Val Ser Ala Gin Glu Ala Lys Gin Pro Glu Lys Glu Glu Lys Asp 
515 520 525 



GTT GAG GTG ATT TTG GTA TCG GCA CAA AAG CGT GAG CAA GCG CTT AAA 
Val Glu Val lie Leu Val Ser Ala Gin Lys Arg Glu Gin Ala Leu Lys 
530 535 540 545 

GAA GTG CCT GTA TCA ATT GAA GTT ATT CAA GGC GAC CTT CTA GA 
Glu Val Pro Val Ser lie Glu Val lie Gin Gly Asp Leu Leu 
550 555 



(2) INFORMATION FOR SEQ ID NO : 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 559 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met Arg Leu Tyr 
1 

Gly Ala Leu Ala 
20 

Tyr Lys Asp Ala 
35 

Tyr Asp Leu Val 
50 

Asp Ser Asn Ala 
65 

Asn Gly Gly Thr 

lie Gin Met Lys 
100 

lie Lys Pro Thr 
115 

Val Gly Val Asn 
130 



Phe Arg Lys Leu 
5 

Ser Ser Ala Ala 

Asp Phe Tyr Val 
40 

Asp Asp Phe Gly 
55 

Leu Gin Arg Ala 
70 

Leu Leu lie Pro 
85 

Ser Asn Val His 

Trp Asn Gly Asp 
120 

Asn lie Val Arg 
135 



Trp Leu Thr Asn 
10 

lie Gly Ala Val 
25 

Ala Pro Thr Gin 

Ala Asn Gly Asn 
60 

lie Asn Ala lie 
75 

Asn Gly Thr Tyr 
90 

lie Arg Val Glu 
105 

Gly Lys Asn His 

Asn Phe Ser Phe 
140 



Leu Phe Leu Gly 
15 

Ser Pro Lys Thr 
30 

Gin Asp Val Asn 
45 

Asp Thr Ser Asp 

Ser Arg Lys Pro 
80 

His Phe Leu Gly 
95 

Ser Asp Val lie 
110 

Arg Leu Phe Glu 
125 

Gin Gly Leu Gly 





Asn 


Gly 


Phe 


Leu 


Val 


Asp 


Phe 


Lys 


Asp 


Ser 


Arg 


Asp 


Lys 


Asn 


Leu 


Ala 




145 










150 










155 










160 




Val 


Phe 


Lys 


Leu 


Gly 
165 


Asp 


Val 


Arg 


Asn 


Tyr 
170 


Lys 


He 


Ser 


Asn 


Phe 
175 


Thr 




He 


Asp 


Asp 


Asn 
180 


Lys 


Thr 


He 


Phe 


Ala 
185 


Ser 


He 


Leu 


Val 


Asp 
190 


Val 


Thr 




Glu 


Arg 


Asn 
195 


Gly 


Arg 


Leu 


His 


Trp 
200 


Ser 


Arg 


Asn 


Gly 


He 
205 


He 


Glu 


Arg 




He 


Lys 
210 


Gin 


Asn 


Asn 


Ala 


Leu 
215 


Phe 


Gly 


Tyr 


Gly 


Leu 
220 


He 


Gin 


Thr 


Tyr 




Gly 


Ala 


Asp 


Asn 


He 


Leu 


Phe 


Arg 


Asn 


Leu 


His 


Ser 


Glu 


Gly 


Gly 


He 




2*25 










230 










235 










240 




Ala 


Leu 


Arg 


Met 


Glu 
245 


Thr 


Asp 


Asn 


Leu 


Leu 
250 


Met 


Lys 


Asn 


Tyr 


Lys 
255 


Gin 




Gly 


Gly 


He 


Arg 
260 


Asn 


He 


Phe 


Ala 


Asp 
265 


Asn 


He 


Arg 


Cys 


Ser 
270 


Lys 


Gly 




Leu 


Ala 


Ala 


Val 


Met 


Phe 


Gly 


Pro 


His 


Phe 


Met 


Lys 


Asn 


Gly 


Asp 


Val 


p 






275 










280 










285 










Gin 


Val 


Thr 


Asn 


Val 


Ser 


Ser 


Val 


Ser 


Cys 


Gly 


Ser 


Ala 


Val 


Arg 


Ser 






290 










295 










300 










''i 

i : M 


Asp 


Ser 


Gly 


Phe 


Val 


Glu 


Leu 


Phe 


Ser 


Pro 


Thr 


Asp 


Glu 


Val 


His 


Thr 


305 










310 










315 










320 




Arg 


Gin 


Ser 


Trp 


Lys 


Gin 


Ala 


Val 


Glu 


Ser 


Lys 


Leu 


Gly 


Arg 


Gly 


Cys 












325 










330 










335 




ii 


Ala 


Gin 


Thr 


Pro 


Tyr 


Ala 


Arg 


Gly 


Asn 


Gly 


Gly 


Thr 


Arg 


Trp 


Ala 


Ala 










340 










345 










350 








Arg 


Val 


Thr 


Gin 


Lys 


Asp 


Ala 


Cys 


Leu 


Asp 


Lys 


Ala 


Lys 


Leu 


Glu 


Tyr 








355 










360 










365 










Gly 


He 


Glu 


Pro 


Gly 


Ser 


Phe 


Gly 


Thr 


Val 


Lys 


Val 


Phe 


Asp 


Val 


Thr 






370 










375 










380 












Ala 


Arg 


Phe 


Gly 


Tyr 


Asn 


Ala 


Asp 


Leu 


Lys 


Gin 


Asp 


Gin 


Leu 


Asp 


Tyr 




385 










390 










395 










400 




Phe 


Ser 


Thr 


Ser 


Asn 
405 


Pro 


Met 


Cys 


Lys 


Arg 
410 


Val 


Cys 


Leu 


Pro 


Thr 
415 


Lys 




Glu 


Gin 


Trp 


Ser 
420 


Lys 


Gin 


Gly 


Gin 


He 
425 


Tyr 


He 


Gly 


Pro 


Ser 
430 


Leu 


Ala 




Ala 


Val 


He 
435 


Asp 


Thr 


Thr 


Pro 


Glu 
440 


Thr 


Ser 


Lys 


Tyr 


Asp 
445 


Tyr 


Asp 


Val 




Lys 


Thr 
450 


Phe 


Asn 


Val 


Lys 


Arg 
455 


He 


Asn 


Phe 


Pro 


Val 
460 


Asn 


Ser 


His 


Lys 




Thr 


He 


Asp 


Thr 


Asn 


Thr 


Glu 


Ser 


Ser 


Arg 


Val 


Cys 


Asn 


Tyr 


Tyr 


Gly 




465 










470 










475 










480 




Met 


Ser 


Glu 


Cys 


Ser 
485 


Ser 


Ser 


Arg 


Trp 


Glu 
490 


Arg 


Met 


Lys 


Gly 


Val 
495 


Ser 




Thr 


Lys 


Asn 


Ala 


Leu 


Leu 


Phe 


Ala 


Gly 


Phe 


Ser 


Leu 


Ser 


Leu 


Val 


Ala 



500 505 510 



Gin Ser Val Ser Ala Gin Glu Ala Lys Gin Pro Glu Lys Glu Glu Lys 

515 520 525 

Asp Val Glu Val lie Leu Val Ser Ala Gin Lys Arg Glu Gin Ala Leu 

530 535 540 

Lys Glu Val Pro Val Ser lie Glu Val lie Gin Gly Asp Leu Leu 
545 550 555 



i'ti 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: join(333. .1805, 1866.. 1997) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CCCTAAAAAC TATTCTTCAT ACCCTTTGAT GTATACGTTT AAACTATAGG GAGTTAATCT 60 
GGTTTTGGTG CAATTCTAGT TTAATAAATG AAGCCTTCTT TTTTGACTTA CATTTTATTA 12 0 

ACCTCTTGAA TTCTTGGGGC TTGCTAATTA TAAAATACTT AATATCAGGT GGTTGTGTAA 180 
AAGAGGTGGA AGGGTATAGG ACCGTTACTT ATAATTGGCC CCTGTCGGAA GGGGGGTTAA 24 0 

AGGTAAAATA GTGTTTAAGT GTATTAATTA ACTTCTATAT AAGTAGGAAA ATACACTATA 3 00 

TATTGCGACA TTATTAACCT TAAATTCTTA CA ATG AAA TTA CAA TTT AAA CCT 3 53 

Met Lys Leu Gin Phe Lys Pro 
1 5 

GTT TAT TTA GCG TCA ATT GCC ATA ATG GCA ATA GGA TGC ACC AAA GAA ' 401 
Val Tyr Leu Ala Ser lie Ala lie Met Ala lie Gly Cys Thr Lys Glu ? 
10 15 20 

GTG ACG GAA AAC GAT ACC TCC GAA ATT TCG GAA GTT CCA ACT GAA TTG 449 
Val Thr Glu Asn Asp Thr Ser Glu lie Ser Glu Val Pro Thr Glu Leu 
25 30 35 

AGG GCC GCG GCT TCT TCA TTT TAT ACC CCA CCG GGT CAG AAT GTA CGG 497 
Arg Ala Ala Ala Ser Ser Phe Tyr Thr Pro Pro Gly Gin Asn Val Arg 
40 45 50 55 



GCC AAT AAA AAA AAC CTG GTC ACG GAT TAC GGT GTT AAC CAC AAT GAT 
Ala Asn Lys Lys Asn Leu Val Thr Asp Tyr Gly Val Asn His Asn Asp 
60 65 70 



CAG AAC GAT GAT AGT AGC AAA TTA AAC CTG GCT ATC AAA GAT TTA TCG 
Gin Asn Asp Asp Ser Ser Lys Leu Asn Leu Ala lie Lys Asp Leu Ser 
75 80 85 



GAT ACC GGT GGT ATA CTG ACC CTT CCT AAG GGA AAG TAC TAT TTG ACC 

Asp Thr Gly Gly lie Leu Thr Leu Pro Lys Gly Lys Tyr Tyr Leu Thr 

90 95 100 

AAA ATT AGA ATG CGC TCT AAT GTA CAT CTT GAA ATA GAA AAG GGA ACG 

Lys lie Arg Met Arg Ser Asn Val His Leu Glu lie Glu Lys Gly Thr 

105 110 115 

GTA ATC TAT CCG ACC AAG GGG TTG ACT CCT GCG AAG AAT CAC AGA ATT 

Val lie Tyr Pro Thr Lys Gly Leu Thr Pro Ala Lys Asn His Arg lie 

120 125 130 135 



TTT GAT TTT GCC AGT AAA ACA GAG GAA AAA ATA GAA AAC GCC AGT ATA 
Phe Asp Phe Ala Ser Lys Thr Glu Glu Lys lie Glu Asn Ala Ser lie 
140 145 . 150 

GTG GGT AAA GGA GGT AAG TTT ATA GTA GAC CTA AGA GGC AAC AGT : TCT 
Val Gly Lys Gly Gly Lys Phe lie Val Asp Leu Arg Gly Asn Ser Ser 
155 160 165 

AAA AAC CAA ATT GTA GCC GAT GTT GGT AAC GTA ACC AAC TTT AAA ATA 
Lys Asn Gin lie Val Ala Asp Val Gly Asn Val Thr Asn Phe Lys lie 
170 175 180 

TCG AAT TTT ACG ATC AAG GAT GAA AAA ACC ATC TTT GCT TCG ATA TTG 
Ser Asn Phe Thr lie Lys Asp Glu Lys Thr lie Phe Ala Ser lie Leu 
185 190 195 



GTA AGC TTT ACG GAT AAG GCA GGC AAT GCT TGG CCA CAT AAA GGT ATT 
Val Ser Phe Thr Asp Lys Ala Gly Asn Ala Trp Pro His Lys Gly lie 
200 205 210 215 



ATT GAG AAT ATA GAC CAG GCG AAT GCC CAT ACG GGA TAT GGC CTC ATA 
lie Glu Asn lie Asp Gin Ala Asn Ala His Thr Gly Tyr Gly Leu lie 
220 225 230 



CAG GCG TAC GCG GCA GAT AAC ATT CTG TTC AAC AAT CTA AGT TGT ACG 1073 
Gin Ala Tyr Ala Ala Asp Asn lie Leu Phe Asn Asn Leu Ser Cys Thr 
235 240 245 

GGC GGG GTA ACC TTG CGT TTA GAA ACC GAC AAC CTC GCT ATG AAA ACC 1121 
Gly Gly Val Thr Leu Arg Leu Glu Thr Asp Asn Leu Ala Met Lys Thr 
250 255 260 

GCT AAA AAA GGG GGG GTA AGG GAT ATT TTT GCC ACA AAG ATC AAG AAT 1169 
Ala Lys Lys Gly Gly Val Arg Asp lie Phe Ala Thr Lys lie Lys Asn 
265 270 275 

ACC AAT GGC TTG ACC CCG GTA ATG TTC TCT CCC CAT TTT ATG GAA AAC 1217 
Thr Asn Gly Leu Thr Pro Val Met Phe Ser Pro His Phe Met Glu Asn 
280 285 290 295 

GGT AAA GTG ACC ATA GAT GAT GTA ACC GCC ATC GGT TGT GCA TAT GCC 12 65 

Gly Lys Val Thr lie Asp Asp Val Thr Ala lie Gly Cys Ala Tyr Ala 
300 305 310 

GTA CGT GTA GAG CAC GGT TTT ATA GAG ATT TTC GAT AAG GGG AAT AGG 1313 
Val Arg Val Glu His Gly Phe lie Glu lie Phe Asp Lys Gly Asn Arg 
315 320 325 

GCA AGT GCC GAC GCT TTC AAG AAC TAT ATT GAA GGT ATT CTA GGA GCT 13 61 

Ala Ser Ala Asp Ala Phe Lys Asn Tyr lie Glu Gly lie Leu Gly Ala 
330 335 340 

GGC TCG GTA GAA GTC GTG TAC AAA CGT AAT AAC GGA AGA ACA TGG GCG 14 09 

Gly Ser Val Glu Val Val Tyr Lys Arg Asn Asn Gly Arg- Thr Trp Ala 
345 350 355 

GCA CGT ATC GCA AAC GAC TTT AAC GAA GCG GCG TAT AAC CAC TCC AAT 1457 
Ala Arg lie Ala Asn Asp Phe Asn Glu Ala Ala Tyr Asn His Ser Asn 
360 365 370 375 

CCT GCC GTT AGC GGA ATC AAA CCA GGG AAA TTC GCC ACA TCT AAG GTA . 1505 

«. 

Pro Ala Val Ser Gly lie Lys Pro Gly Lys Phe Ala Thr Ser Lys Val <> 
380 385 390 

ACC AAT GTT AAG GCA ACC TAT AAG GGT ACT GGC GCC AAA CTC AAG CAG 1553 
Thr Asn Val Lys Ala Thr Tyr Lys Gly Thr Gly Ala Lys Leu Lys Gin 
395 400 405 



26 



GCA TTC TTA TCC TAT TTA CCC TGT TCG GAA CGT TCT AAG GTT TGT CGG 1601 
Ala Phe Leu Ser Tyr Leu Pro Cys Ser Glu Arg Ser Lys Val Cys Arg 
410 415 420 

CCA GGT CCA GAT GGG TTC GAG TAT AAC GGA CCC TCC TTG GGA GTT ACC 1649 
Pro Gly Pro Asp Gly Phe Glu Tyr Asn Gly Pro Ser Leu Gly Val Thr 
425 430 435 

ATC GAT AAC ACG AAA AGG GAC AAC AGC CTT GGC AAT TAT AAC GTC AAT 16 97 

lie Asp Asn Thr Lys Arg Asp Asn Ser Leu Gly Asn Tyr Asn Val Asn 
440 445 450 455 

GTA AGC ACC TCC AGT GTT CAG GGC TTT CCC AAT AAT TAC GTT TTA AAC 1745 
Val Ser Thr Ser Ser Val Gin Gly Phe Pro Asn Asn Tyr Val Leu Asn 
460 465 470 

GTA AAG TAT AAT ACC CCT AAA GTA TGT AAC CAA AAT CTA GGT AGT ATT 17 93 

Val Lys Tyr Asn Thr Pro Lys Val Cys Asn Gin Asn Leu Gly Ser lie 
475 480 485 



m ACT TCG TGT AAC TGATCACGAA ACAATTTGTA AATAAAAAGC AGCTGTCCCT 1845 
;;Q Thr Ser Cys Asn 

fl| 490 

TATTACGGGC GGCTGCTTTT ATG TCT TTA AGC CAT GTC GTG ATT TAT TGG 18 9 5 

f Met Ser Leu Ser His Val Val lie Tyr Trp 

495 500 

CGA CTT TTG ATA AAG GCT TGG ATT TCT TCC GGG GTA AAT ATC GGA TTG 19 43 

Arg Leu Leu lie Lys Ala Trp lie Ser Ser Gly Val Asn lie Gly Leu 
il 505 510 515 

GCC CCT TCC CTA CCG GCT ACC ATA GCT CTA TGC TCC TAT GCA CAG GCG 1991 
Ala Pro Ser Leu Pro Ala Thr lie Ala Leu Cys Ser Tyr Ala Gin Ala 
520 525 530 

AAA TCT 1997 

c 

Lys Ser >■ 
535 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 535 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Lys Leu Gin Phe Lys Pro Val Tyr Leu Ala Ser lie Ala lie Met 

15 10 15 

Ala lie Gly Cys Thr Lys Glu Val Thr Glu Asn Asp Thr Ser Glu lie 

20 25 30 

Ser Glu Val Pro Thr Glu Leu Arg Ala Ala Ala Ser Ser Phe Tyr Thr 

35 40 45 

Pro Pro Gly Gin Asn Val Arg Ala Asn Lys Lys Asn Leu Val Thr Asp 

50 55 60 

Tyr Gly Val Asn His Asn Asp Gin Asn Asp Asp Ser Ser Lys Leu Asn 
65 70 75 80 

Leu Ala lie Lys Asp Leu Ser Asp Thr Gly Gly lie Leu Thr Leu Pro 

85 90 95 

Lys Gly Lys Tyr Tyr Leu Thr Lys lie Arg Met Arg Ser Asn Val His 

100 105 110 

Leu Glu lie Glu Lys Gly Thr Val lie Tyr Pro Thr Lys Gly Leu Thr 

115 120 125 

Pro Ala Lys Asn His Arg lie Phe Asp Phe Ala Ser Lys Thr Glu Glu 

130 135 140 

Lys lie Glu Asn Ala Ser lie Val Gly Lys Gly Gly Lys Phe lie Val 
145 150 155 160 

Asp Leu Arg Gly Asn Ser Ser Lys Asn Gin lie Val Ala Asp Val' Gly 

165 170 175 

Asn Val Thr Asn Phe Lys lie Ser Asn Phe Thr lie Lys Asp Glu Lys 

180 185 190 

Thr lie Phe Ala Ser lie Leu Val Ser Phe Thr Asp Lys Ala Gly Asn 

195 200 205 

Ala Trp Pro His Lys Gly He He Glu Asn He Asp Gin Ala Asn Ala 

210 215 220 

His Thr Gly Tyr Gly Leu He Gin Ala Tyr Ala Ala Asp Asn He Leu 
225 230 235 240 

Phe Asn Asn Leu Ser Cys Thr Gly Gly Val Thr Leu Arg Leu Glu Thr 

245 250 255 

Asp Asn Leu Ala Met Lys Thr Ala Lys Lys Gly Gly Val Arg Asp He 

260 265 270 

Phe Ala Thr Lys He Lys Asn Thr Asn Gly Leu Thr Pro Val Met Phe 

275 280 285 

Ser Pro His Phe Met Glu Asn Gly Lys Val Thr He Asp Asp Val Thr 

290 295 300 

Ala He Gly Cys Ala Tyr Ala Val Arg Val Glu His Gly Phe He Glu 
305 310 315 320 
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lie Phe Asp Lys Gly Asn Arg Ala Ser Ala Asp Ala Phe Lys Asn Tyr 

325 330 335 

lie Glu Gly lie Leu Gly Ala Gly Ser Val Glu Val Val Tyr Lys Arg 

340 345 350 

Asn Asn Gly Arg Thr Trp Ala Ala Arg lie Ala Asn Asp Phe Asn Glu 

355 360 365 

Ala Ala Tyr Asn His Ser Asn Pro Ala Val Ser Gly lie Lys Pro Gly 

370 375 380 

Lys Phe Ala Thr Ser Lys Val Thr Asn Val Lys Ala Thr Tyr Lys Gly 
385 390 395 400 

Thr Gly Ala Lys Leu Lys Gin Ala Phe Leu Ser Tyr Leu Pro Cys Ser 

405 410 415 

Glu Arg Ser Lys Val Cys Arg Pro Gly Pro Asp Gly Phe Glu Tyr Asn 

420 425 430 

Gly Pro Ser Leu Gly Val Thr lie Asp Asn Thr Lys Arg Asp Asn Ser 

435 440 445 

Leu Gly Asn Tyr Asn Val Asn Val Ser Thr Ser Ser Val Gin Gly Phe 
450 455 460 

,r% Pro Asn Asn Tyr Val Leu Asn Val Lys Tyr Asn Thr Pro Lys Val Cys 
465 470 475 480 

\[Q Asn Gin Asn Leu Gly Ser lie Thr Ser Cys Asn Met Ser Leu Ser His 
i!9 485 490 495 

i'M Val Val lie Tyr Trp Arg Leu Leu lie Lys Ala Trp lie Ser Ser Gly 
^3 500 505 510 

Val Asn lie Gly Leu Ala Pro Ser Leu Pro Ala Thr lie Ala Leu Cys 
"?„ 515 520 525 

Ser Tyr Ala Gin Ala Lys Ser 
L 530 535 



(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join (1. .498, 741.. 1931, 2009.. 2179) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



GAT CAT ATC ATT CCT TTG CAA ATT AAA AAT TCT CAA GAT AGT CAA ATA 48 
Asp His lie lie Pro Leu Gin lie Lys Asn Ser Gin Asp Ser Gin lie 
1 5 10 15 

ATT AGT TTT TTT AAA GCT GAC AAA GGG AGT GTG AGC AGG CAA GTA CAC 9 6 

lie Ser Phe Phe Lys Ala Asp Lys Gly Ser Val Ser Arg Gin Val His 
20 25 30 

CCA CCT TGG CCT GTG CCT TGT AAA AGT AAA CTG CAA GAG CAA GAT AGT 144 
Pro Pro Trp Pro Val Pro Cys Lys Ser Lys Leu Gin Glu Gin Asp Ser 
35 40 45 

AGT GAG TCT AAA GAG AGT AAG GCA GAG CAA GTT AAA ATT AAC AAC TGC 192 
Ser Glu Ser Lys Glu Ser Lys Ala Glu Gin Val Lys He Asn Asn Cys 
50 55 60 

GTT GTA CAG AAC GCA ATG CTG TAC ATA GAA AAC AAT TAT TTC AAC GAT 240 
Val Val Gin Asn Ala Met Leu Tyr He Glu Asn Asn Tyr Phe Asn Asp 
65 70 75 80 

;;S ATA AAT ATA GAC ACG GTT GCT TTT TCT GTT GGC GTA AGT CGC TCT TAT 2 88 

lid He Asn He Asp Thr Val Ala Phe Ser Val Gly Val Ser Arg Ser Tyr 

111 85 90 95 

CTC GTT AAA CAA TTT AAG TTA GCA ACG AAT AAA ACG ATT AAT AAT AGA 33 6 

Leu Val Lys Gin Phe Lys Leu Ala Thr Asn Lys Thr He Asn Asn Arg 
100 105 110 

U 

\n ATC ATA GAA GTA AGA ATA GAG CAG GCT AAA AAA GTA TTA CTA AAA AAA 3 84 

f5 He He Glu Val Arg He Glu Gin Ala Lys Lys Val Leu Leu Lys Lys 
\1 115 120 125 

TCT GTT ACA GAA ACA GCT TAT GAA GTT GGT TTT AAT AAC TCA AAC TAC 43 2 

Ser Val Thr Glu Thr Ala Tyr Glu Val Gly Phe Asn Asn Ser Asn Tyr 
130 135 140 

TTC GCG ACA GTT TTT AAA AAA AGA ACA AAC TAC ACG CCC AAG CAA TTT | 480 
Phe Ala Thr Val Phe Lys Lys Arg Thr Asn Tyr Thr Pro Lys Gin Phe < 
145 150 155 160 

AAA CGT ACT TTT TCC AGC TAAAACTACA ACTAAATAAC GATTAAAAGC 528 
Lys Arg Thr Phe Ser Ser 
165 

CATTTTTAGA GAACAGTAAA ACCATTTTTT GAGGTTTGGT GTTGTATATA AATATTAAAT 5 88 



ATCCCCACTC GCTCAGCTTT TTTTGTGCGA GTTGTGAGAA TTAGCTTAAC AGGTAAGGTT 648 
TACGTATCTG TATATCTAAA CTCTTCGAAT ATAACACTGT ATCTGTTGCT GAGCTGTGGC 7 08 

TCAGTTCACA CTAACAAAGG ATGGATAAAT AA ATG AAA CCT ATA AGT ATT GTG 761 

Met Lys Pro lie Ser lie Val 
170 

GCA TTC CCT ATA CCA GCT ATA AGT ATG CTT CTT TTA AGT GCA GTA TCA 809 
Ala Phe Pro lie Pro Ala lie Ser Met Leu Leu Leu Ser Ala Val Ser 
175 180 185 

CAA GCA GCA TCT ATG CAA CCT CCC ATC GCA AAA CCT GGT GAA ACA TGG 857 
Gin Ala Ala Ser Met Gin Pro Pro lie Ala Lys Pro Gly Glu Thr Trp 
190 195 200 205 

ATT TTA CAA GCC AAA CGC TCT GAC GAA TTT AAC GTA AAA GAT GCG ACA 905 
lie Leu Gin Ala Lys Arg Ser Asp Glu Phe Asn Val Lys Asp Ala Thr 
210 215 220 

AAG TGG AAC TTT CAA ACA GAA AAC TAT GGG GTA TGG TCT TGG AAA AAT 953 
^5 Lys Trp Asn Phe Gin Thr Glu Asn Tyr Gly Val Trp Ser Trp Lys Asn 

i;g 225 230 235 

I'M GAA AAT GCG ACA GTA TCT AAT GGC AAA CTA AAA TTA ACC ACT AAG CGA 1001 

Glu Asn Ala Thr Val Ser Asn Gly Lys Leu Lys Leu Thr Thr Lys Arg 
j * 240 245 250 

j.^ GAA TCT CAT CAA CGT ACA TTC TGG GAT GGC TGT AAT CAG CAG CAA GTT 1049 
Ljl Glu Ser His Gin Arg Thr Phe Trp Asp Gly Cys Asn Gin Gin Gin Val 
[n 255 260 265 

set. 

1^ GCA AAT TAC CCA CTT TAT TAT ACA TCG GGT GTC GCT AAA TCC AGA GCT 1097 

Ala Asn Tyr Pro Leu Tyr Tyr Thr Ser Gly Val Ala Lys Ser Arg Ala 
270 275 280 285 

ACA GGT AAT TAT GGC TAT TAC GAA GCT CGA ATC AAA GGA GCG AGT ACA 1145 
Thr Gly Asn Tyr Gly Tyr Tyr Glu Ala Arg lie Lys Gly Ala Ser Thr 
290 295 300 ■ 

TTT CCT GGC GTA TCG CCT GCT TTT TGG ATG TAT AGC ACC ATT GAC CGT 1193 
Phe Pro Gly Val Ser Pro Ala Phe Trp Met Tyr Ser Thr lie Asp Arg 
305 310 315 

TCA TTA ACG AAA GAA GGG GAT GTC CAA TAT AGC GAA ATA GAC GTA GTG ' 1241 
Ser Leu Thr Lys Glu Gly Asp Val Gin Tyr Ser Glu lie Asp Val Val 
320 325 330 
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GAA CTT ACT CAA AAA AGT GCA GTG AGA GAG TCT GAT CAT GAC TTA CAC 
Glu Leu Thr Gin Lys Ser Ala Val Arg Glu Ser Asp His Asp Leu His 
335 340 345 

AAT ATT GTA GTA AAA AAT GGA AAA CCA ACA TGG ATG CGT CCA GGG TCT 
Asn lie Val Val Lys Asn Gly Lys Pro Thr Trp Met Arg Pro Gly Ser 
350 355 360 365 

TTT CCG CAG ACA AAT CAT AAC GGA TAC CAT CTA CCT TTC GAT CCT CGA 
Phe Pro Gin Thr Asn His Asn Gly Tyr His Leu Pro Phe Asp Pro Arg 
370 375 380 

AAT GAC TTT CAC ACC TAT GGT GTC AAT GTA ACT AAA GAC AAG ATC ACT 
Asn Asp Phe His Thr Tyr Gly Val Asn Val Thr Lys Asp Lys lie Thr 
385 390 395 

t ■ 

TGG TAC GTA GAT GGT GAA ATT GTG GGC GAA AAG GAT AAC TTA TAC TGG 
Trp Tyr Val Asp Gly Glu lie Val Gly Glu Lys Asp Asn Leu Tyr Trp 
400 405 410 

CAT CGT CAA ATG AAT CTC ACA TTA TCA CAA GGC TTA CGC GCG CCG CAT 
His Arg Gin Met Asn Leu Thr Leu Ser Gin Gly Leu Arg Ala Pro His 
415 420 425 

ACA CAA TGG AAA TGT AAT CAA TTT TAC CCA TCA GCG AAT AAA TCA : GCA 
Thr Gin Trp Lys Cys Asn Gin Phe Tyr Pro Ser Ala Asn Lys Ser Ala 
430 435 440 445 

GAA GGC TTC CCA ACA TCA ATG GAA GTT GAT TAT GTA AGA ACG TGG GTA 
Glu Gly Phe Pro Thr Ser Met Glu Val Asp Tyr Val Arg Thr Trp Val 
450 455 460 

AAG GTG GGC AAT AAC AAC TCT GCT CCA GGC GAG GGG CAG TCA TGT CCT 
Lys Val Gly Asn Asn Asn Ser Ala Pro Gly Glu Gly Gin Ser Cys Pro 
465 470 475 

AAC ACG TTT GTA GCT GTC AAT AGT GTT CAA CTA AGC GCA GCA AAA CAA 
Asn Thr Phe Val Ala Val Asn Ser Val Gin Leu Ser Ala Ala Lys Gin 
480 485 490 



ACA CTT CGA AAG GGC CAA TCT ACA ACG CTA GAA AGC ACA GTT CTT CCA 
Thr Leu Arg Lys Gly Gin Ser Thr Thr Leu Glu Ser Thr Val Leu Pro 
495 500 505 



AAC TGT GCA ACC AAC AAG AAA GTC ATT TAT TCA TCA AGC AAT AAA AAT 
Asn Cys Ala Thr Asn Lys Lys Val lie Tyr Ser Ser Ser Asn Lys Asn 
510 515 520 525 

GTG GCA ACT GTG AAC AGT GCT GGC GTT GTA AAA GCT AAA AAT AAA GGC 
Val Ala Thr Val Asn Ser Ala Gly Val Val Lys Ala Lys Asn Lys Gly 
530 535 540 

ACT GCG ACG ATT ACG GTT AAA ACT AAA AAC AAA GGG AAA ATA GAT AAA 
Thr Ala Thr lie Thr Val Lys Thr Lys Asn Lys Gly Lys lie Asp Lys 
545 550 555 

TTA ACC ATT GCG GTG AAT TAAGCTAACT CAAACTAGCC TCGAAGGATT 
Leu Thr lie Ala Val Asn 
560 

GAGGCACTTT ATTTATAGGT CTCAGGCTTC GACTTTTTGG AGGGGGT ATG AAA AAG 

Met Lys Lys 
565 

GTA AAT TTA TCC AGC AAG TGG ATA ATT AGC ATT AGT TTA CTA ATC ATT 
Val Asn Leu Ser Ser Lys Trp lie lie Ser lie Ser Leu Leu lie lie 
570 575 580 

TGT GAT TAT GTT TAT TTA ATA CGA ACA AAC GTT AAC GAG CAA GCT AAC 
Cys Asp Tyr Val Tyr Leu lie Arg Thr Asn Val Asn Glu Gin Ala Asn 
585 590 595 

GCA GAA GCT ACT GCA CAT ATG CAT TAC AAA ATA AAT AAT ACG AAA CAC 
Ala Glu Ala Thr Ala His Met His Tyr Lys lie Asn Asn Thr Lys His 
600 605 610 

TCA AAA GGA AAG CTT GAT C 
Ser Lys Gly Lys Leu Asp 
615 620 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Asp His lie lie Pro Leu Gin lie Lys Asn Ser Gin Asp Ser Gin lie 

15 10 15 

lie Ser Phe Phe Lys Ala Asp Lys Gly Ser Val Ser Arg Gin Val His 

20 25 30 

Pro Pro Trp Pro Val Pro Cys Lys Ser Lys Leu Gin Glu Gin Asp Ser 

35 40 45 

Ser Glu Ser Lys Glu Ser Lys Ala Glu Gin Val Lys lie Asn Asn Cys 

50 55 60 

Val Val Gin Asn Ala Met Leu Tyr lie Glu Asn Asn Tyr Phe Asn Asp 
65 70 75 80 

lie Asn lie Asp Thr Val Ala Phe Ser Val Gly Val Ser Arg Ser Tyr 

85 90 95 

Leu Val Lys Gin Phe Lys Leu Ala Thr Asn Lys Thr lie Asn Asn Arg 

100 105 - 110 

lie lie Glu Val Arg lie Glu Gin Ala Lys Lys Val Leu Leu Lys Lys 
j?n 115 120 125 

l*% Ser Val Thr Glu Thr Ala Tyr Glu Val Gly Phe Asn Asn Ser Asn Tyr 
3 130 135 140 

l|0 Phe Ala Thr Val Phe Lys Lys Arg Thr Asn Tyr Thr Pro Lys Gin Phe 
CO 145 150 155 160 

i'U Lys Arg Thr Phe Ser Ser Met Lys Pro lie Ser lie Val Ala Phe Pro 
Q 165 170 175 

lie Pro Ala lie Ser Met Leu Leu Leu Ser Ala Val Ser Gin Ala' Ala 

L. 180 185 190 

U Ser Met Gin Pro Pro lie Ala Lys Pro Gly Glu Thr Trp lie Leu Gin 

195 200 205 

i : Q Ala Lys Arg Ser Asp Glu Phe Asn Val Lys Asp Ala Thr Lys Trp Asn 
□ 210 215 220 

I Phe Gin Thr Glu Asn Tyr Gly Val Trp Ser Trp Lys Asn Glu Asn Ala 
225 230 235 240 

Thr Val Ser Asn Gly Lys Leu Lys Leu Thr Thr Lys Arg Glu Ser His 

245 250 255 

Gin Arg Thr Phe Trp Asp Gly Cys Asn Gin Gin Gin Val Ala Asn Tyr 

260 265 270 

Pro Leu Tyr Tyr Thr Ser Gly Val Ala Lys Ser Arg Ala Thr Gly Asn 

275 280 285 ¥ 

Tyr Gly Tyr Tyr Glu Ala Arg lie Lys Gly Ala Ser Thr Phe Pro Gly 

290 295 . 300 

Val Ser Pro Ala Phe Trp Met Tyr Ser Thr lie Asp Arg Ser Leu Thr 
305 310 315 320 

Lys Glu Gly Asp Val Gin Tyr Ser Glu lie Asp Val Val Glu Leu Thr 

325 330 335 

Gin Lys Ser Ala Val Arg Glu Ser Asp His Asp Leu His Asn He Val 
340 345 350 



Val Lys Asn Gly Lys Pro Thr Trp Met Arg Pro Gly Ser Phe Pro Gin 

355 360 365 

Thr Asn His Asn Gly Tyr His Leu Pro Phe Asp Pro Arg Asn Asp Phe 

370 375 380 

His Thr Tyr Gly Val Asn Val Thr Lys Asp Lys lie Thr Trp Tyr Val 
385 390 395 400 

Asp Gly Glu lie Val Gly Glu Lys Asp Asn Leu Tyr Trp His Arg Gin 

405 410 415 

Met Asn Leu Thr Leu Ser Gin Gly Leu Arg Ala Pro His Thr Gin Trp 

420 425 430 

Lys Cys Asn Gin Phe Tyr Pro Ser Ala Asn Lys Ser Ala Glu Gly Phe 

435 440 445 

Pro Thr Ser Met Glu Val Asp Tyr Val Arg Thr Trp Val Lys Val Gly 

450 455 460 

Asn Asn Asn Ser Ala Pro Gly Glu Gly Gin Ser Cys Pro Asn Thr Phe 
465 ■ 470 475 480 

Val Ala Val Asn Ser Val Gin Leu Ser Ala Ala Lys Gin Thr Leu Arg 

485 490 495 

Lys Gly Gin Ser Thr Thr Leu Glu Ser Thr Val Leu Pro Asn Cys Ala 

500 505 510 

Thr Asn Lys Lys Val lie Tyr Ser Ser Ser Asn Lys Asn Val Ala Thr 

515 520 525 

Val Asn Ser Ala Gly Val Val Lys Ala Lys Asn Lys Gly Thr Ala Thr 

530 535 540 

lie Thr Val Lys Thr Lys Asn Lys Gly Lys lie Asp Lys Leu Thr lie 
545 550 555 560 

Ala Val Asn Met Lys Lys Val Asn Leu Ser Ser Lys Trp lie lie Ser 

565 570 575 

lie Ser Leu Leu lie lie Cys Asp Tyr Val Tyr Leu lie Arg Thr Asn 

580 585 590 

Val Asn Glu Gin Ala Asn Ala Glu Ala Thr Ala His Met His Tyr Lys 

595 600 605 

lie Asn Asn Thr Lys His Ser Lys Gly Lys Leu Asp 
610 615 620 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 875. .2509 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GCCTCCGTAT TCGACAATGT TGTACGATGC TTGGCGATTC GGACTCTGTT TAAGCACTCG 60 

ATTTCGTAAA GGCACTATCC ACTCATTCAT TCCGACTCAA TATTCTTTTC GACAAATGCA 12 0 

ACCGGTTCCA TTGAAAAGGC CCTAAAAATA CAGCTTTCCC GCCCCCCATC GTAGAAGGTT 18 0 

CCAATATGCT TCAACCCCTT TTTCAGCCTT ACTTCAGGGG TATTACTTTC ATGCCTAGGG 240 

CCGCAAATAC ATTCGCTTGG ACCCAGTCAC C T AT AT AATT GAATACGGAA CTACCCATGG 3 00 

CTTCCTTCCC TTTGGGAACC TATGGTACAG ACTTGCCTTT TTTAAACCGG TTACTTCAGC 3 60- 

TAATTCGCCA AGCTGGTTCC TTCATAACCT TTGGCCCGAA ACACCTTGCA AGCACATAAA 42 0 

TCTTATCCAA TATTTTGCGG TCTCATGGGA CAAATCTATA ACAAACATTC AATTTTACCA 480 

AACGTTCGGT AATAAATCTA GTCAAAAACG GGGTCCGATT CATTTTAGAA GAAAGGTAAA 540 

GCCCCCAAAA GAGCGGTTTA CTTGAAGATA TGATTTATAA AACACAATAA GTGACAAAGG 600 

AAGATCATGG CTATAATTAG TTGAAAAAAC AGGGCTTACC ATGACATGGA GCTTTATTGA 660 

AAACAGATGT CCAACAAGAA TAAAGGAGGG CCGTTCGACC GCGACGTTTA AATAAAAACA 72 0 

TATTCCATAT CAAAATTTAA TTAAGGTTCT TTCCTACAGT ATTTATAAGA AATTAC T AAA 78 0 

ATT AG TT AGG ATAATACTAC AAAATGGTAA AATTGGATTA CTCAGATTGA ACCATAGCCT 840 

CTACTTTAGT CGGCTAACAA AAACAATTAT AGTA ATG AAA AAA CCA AAT TTT 892 

Met Lys Lys Pro Asn Phe 
1 5 

TAT GGC AAG ATG GGT AGA ACT GCA CTT TCA AGT CTT TTC TAC CTC TTT 940 
Tyr Gly Lys Met Gly Arg Thr Ala Leu Ser Ser Leu Phe Tyr Leu Phe 
10 15 20 

TTC CTA GGC CTT GTG TAT GGG CAA CAA CCT ACG AAG ACT TCA AAT CCG 9 88 

Phe Leu Gly Leu Val Tyr Gly Gin Gin Pro Thr Lys Thr Ser Asn Pro 
25 30 35 

AAC GAT CAG TGG ACC ATC AAA TGG AGT GCT TCG GAC GAA TTC AAC AAA 103 6 

Asn Asp Gin Trp Thr lie Lys Trp Ser Ala Ser Asp Glu Phe Asn Lys 
40 45 50 

AAT GAC CCC GAC TGG GCA AAA TGG ATC AAG ACA GGA AAC CTT CCG AAT 1084 
Asn Asp Pro Asp Trp Ala Lys Trp lie Lys Thr Gly Asn Leu Pro Asn 
55 60 65 70 

ACA TCG GCA TGG AAA TGG AAC AAT CAA AAA AAC GTA AAG ATT TCC AAC 1132 
Thr Ser Ala Trp Lys Trp Asn Asn Gin Lys Asn Val Lys lie Ser Asn 
75 80 85 



GGA ATT GCG GAA CTA ACG ATG AGG CAT AAC GCC AAT AAT ACC CCA CCT 
Gly lie Ala Glu Leu Thr Met Arg His Asn Ala Asn Asn Thr Pro Pro 
90 95 100 

GAC GGA GGA ACC TAT TTC ACC TCT GGG ATA TTT AAG TCG TAC CAA AAA 
Asp Gly Gly Thr Tyr Phe Thr Ser Gly lie Phe Lys Ser Tyr Gin Lys 
105 110 115 

TTT ACG TAT GGA TAC TTT GAG GCC AAA ATC CAA GGA GCG GAT ATA GGT 
Phe Thr Tyr Gly Tyr Phe Glu Ala Lys lie Gin Gly Ala Asp lie Gly 
120 125 130 

GAA GGC GTA TGC CCA TCG TTT TGG CTT TAT AGT GAT TTC GAC TAT TCC 
Glu Gly Val Cys Pro Ser Phe Trp Leu Tyr Ser Asp Phe Asp Tyr Ser 
135 140 145 150 

J 

GTA GCC AAT GGG GAA ACG GTA TAC AGT GAA ATA GAT GTA GTT GAA CTA 
Val Ala Asn Gly Glu Thr Val Tyr Ser Glu lie Asp Val Val Glu Leu 
155 160 165 

CAA CAA TTC GAT TGG TAT GAA GGC CAT CAG GAC GAC ATT TAC GAC ATG 
Gin Gin Phe Asp Trp Tyr Glu Gly His Gin Asp Asp lie Tyr Asp Met 
170 175 . 180 

GAC TTA AAT CTA CAC GCC GTT GTC AAA GAA AAC GGA CAG GGG GTT TGG 
Asp Leu Asn Leu His Ala Val Val Lys Glu Asn Gly Gin Gly Val Trp 
185 190 195 

AAA AGG CCA AAA ATG TAC CCT CAA GAA CAG TTG AAC AAA TGG AGA GCC 
Lys Arg Pro Lys Met Tyr Pro Gin Glu Gin Leu Asn Lys Trp Arg Ala 
200 205 210 

ATG GAC CCG AGT AAA GAC TTT CAT ATC TAT GGT TGT GAA GTG AAC CAG 
Met Asp Pro Ser Lys Asp Phe His lie Tyr Gly Cys Glu Val Asn Gin 
215 - 220 225 230 

AAC GAA ATC ATA TGG TAT GTT GAC GGT GTC GAG GTT GCC CGA AAA CCA 
Asn Glu He He Trp Tyr Val Asp Gly Val Glu Val Ala Arg Lys Pro 
235 240 245 



AAT AAA TAT TGG CAT CGC CCC ATG AAC GTT ACC CTT TCA TTG GGA CTC 
Asn Lys Tyr Trp His Arg Pro Met Asn Val Thr Leu Ser Leu Gly Leu 
250 255 260 



AGA AAA CCA TTT GTG AAA TTT TTC GAC AAT AAG AAC AAT GCC ATA AAT 1708 
Arg Lys Pro Phe Val Lys Phe Phe Asp Asn Lys Asn Asn Ala lie Asn 
265 270 275 

CCA GAA ACC GAT GCC AAG GCA AGG GAA AAA TTA TCG GAT ATA CCT ACA 17 56 

Pro Glu Thr Asp Ala Lys Ala Arg Glu Lys Leu Ser Asp lie Pro Thr 
280 285 290 

TCG ATG TAT GTG GAT TAC GTT CGG GTC TGG GAA AAA TCA GCA GGT AAC 1804 
Ser Met Tyr Val Asp Tyr Val Arg Val Trp Glu Lys Ser Ala Gly Asn 
295 300 305 310 

ACT ACC AAT CCC CCA ACC AGC GAG GTC GGC ACA CTA AAA ACA AAG GGT 1852 
Thr Thr Asn Pro Pro Thr Ser Glu Val Gly Thr Leu Lys Thr Lys Gly 
315 320 325 

TCG AAA CTG GTG ATT GAC CAT TGG GAT GCA AGT ACA GGG ACT ATT TCG 1900 
Ser Lys Leu Val lie Asp His Trp Asp Ala Ser Thr Gly Thr lie Ser 
iS 330 335 340 

: by 

CO GCT GTC AGT AAC AAT ACA AAG ACA GGT CAA TAT GCC GGT TCA GTG AAC 1948 

Ala Val Ser Asn Asn Thr Lys Thr Gly Gin Tyr Ala Gly Ser Val Asn 
^ 345 350 355 

f ii 

AAC GCG AGC ATC GCC CAG ATA GTA ACA TTA AAA GCG AAT ACT TCA TAT 1996 

,x Asn Ala Ser lie Ala Gin lie Val Thr Leu Lys Ala Asn Thr Ser Tyr 
^ 360 365 370 



AAG GTA TCG GCT TTC GGA AAG GCC AGC TCA CCC GGA ACA TCG GCT TAT 2044 
Lys Val Ser Ala Phe Gly Lys Ala Ser Ser Pro Gly Thr Ser Ala Tyr 
375 380 385 390 

CTA GGC ATT AGT AAA GCA TCC AAC AAC GAA CTC ATA AGC AAT TTT GAA 2 0 92 

Leu Gly lie Ser Lys Ala Ser Asn Asn Glu Leu lie Ser Asn Phe Glu 
395 400 405 

TTC AAA ACA ACC TCA TAC TCC AAA GGC GAG ATT GAG ATA AGA ACT GGA 2140 
Phe Lys Thr Thr Ser Tyr Ser Lys Gly Glu lie Glu lie Arg Thr Gly < 
410 415 420 

AAT GTT CAG GAA TCA TAT CGC ATA TGG TAT TGG TCT TCC GGG CAA GCC 2188 
Asn Val Gin Glu Ser Tyr Arg lie Trp Tyr Trp Ser Ser Gly Gin Ala 
425 430 435 
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TAT tGC GAT GAT TTT AAC CTT GTT GAA ATA AAC AGC GGG GCT TCA CAA 223 6 

Tyr Cys Asp Asp Phe Asn Leu Val Glu lie Asn Ser Gly Ala Ser Gin 
440 445 450 

CTC AAT GAA AAT GAG ACT GAA ACA GCA CTG GAA AAA GGT ATA CAC ATT 2284 
Leu Asn Glu Asn Glu Thr Glu Thr Ala Leu Glu Lys Gly lie His lie 
455 460 465 470 

TAT CCG AAT CCC TAT AAA AAC GGT CCA TTG ACA ATC GAT TTT GGC AAA 2332 
Tyr Pro Asn Pro Tyr Lys Asn Gly Pro Leu Thr lie Asp Phe Gly Lys 
475 480 485 

CCC TTC AGC GGC GAG GTC CAA ATC ACC GGT TTA AAC GGT AGA ACA TTC 23 80 

Pro Phe Ser Gly Glu Val Gin lie Thr Gly Leu Asn Gly Arg Thr Phe 
490 495 500 

TTA AGA AGA AAT GTT GTC GAT CAA ACT TCG GTT CAG CTC CTA GAA TCC 2428 
t ,e s Leu Arg Arg Asn Val Val Asp Gin Thr Ser Val Gin Leu Leu Glu Ser 
=S 505 510 515 

W AAA TCT AAA TTC AAG AGC GGT CTA TAT ATC GTT AAA ATT AGT GGC CCG 2476 
■;3 Lys Ser Lys Phe Lys Ser Gly Leu Tyr lie Val Lys lie Ser Gly Pro 
520 525 530 

S' "T; 

GAT GGA GAG GTT TCA AAA AAG ATA CTC GTG GAG TAACTAAAAA TCAATTTTTA 2529 
[A Asp Gly Glu Val Ser Lys Lys lie Leu Val Glu 
U 535 540 545 



0 CAGGATTACA GACGGGCAAA GGGATTTTCC TTTGCCCGTT TTTAAAATTA TGGGCGGAAA 2589 
CGATTGTTGC G 2600 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 545 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Lys Lys Pro Asn Phe Tyr Gly Lys Met Gly Arg Thr Ala Leu Ser 

15 10 15 

Ser Leu Phe Tyr Leu Phe Phe Leu Gly Leu Val Tyr Gly Gin Gin Pro 
20 25 30 



Thr Lys Thr Ser Asn Pro Asn Asp Gin Trp Thr lie Lys Trp Ser Ala 

35 40 45 

Ser Asp Glu Phe Asn Lys Asn Asp Pro Asp Trp Ala Lys Trp lie Lys 

50 55 60 

Thr Gly Asn Leu Pro Asn Thr Ser Ala Trp Lys Trp Asn Asn Gin Lys 
65 70 75 80 

Asn Val Lys lie Ser Asn Gly lie Ala Glu Leu Thr Met Arg His Asn 

85 90 95 

Ala Asn Asn Thr Pro Pro Asp Gly Gly Thr Tyr Phe Thr Ser Gly He 

100 105 110 

Phe Lys Ser Tyr Gin Lys Phe Thr Tyr Gly Tyr Phe 'Glu Ala Lys He 

115 120 125 

Gin Gly Ala Asp lie Gly Glu Gly Val Cys Pro Ser Phe Trp Leu Tyr 

130 135 140 

Ser Asp Phe Asp Tyr Ser Val Ala Asn Gly Glu fhr Val Tyr Ser Glu 
145 150 155 160 

lie Asp Val Val Glu Leu Gin Gin Phe Asp Trp Tyr Glu Gly His Gin 

165 170 175 

Asp Asp He Tyr Asp Met Asp Leu Asn Leu His Ala Val Val Lys Glu 

180 185 190 

Asn Gly Gin Gly Val Trp Lys Arg Pro Lys Met Tyr Pro Gin Glu Gin 

195 200 205 

Leu Asn Lys Trp Arg Ala Met Asp Pro Ser Lys Asp Phe His He Tyr 

210 215 220 

Gly Cys Glu Val Asn Gin Asn Glu He He Trp Tyr Val Asp Gly Val 
225 230 235 240 

Glu Val Ala Arg Lys Pro Asn Lys Tyr Trp His Arg Pro Met Asn Val 

245 250 255 

Thr Leu Ser Leu Gly Leu Arg Lys Pro Phe Val Lys Phe Phe Asp Asn 

260 265 270 

Lys Asn Asn Ala He Asn Pro Glu Thr Asp Ala Lys Ala Arg Glu Lys 

275 280 285 

Leu Ser Asp He Pro Thr Ser Met Tyr Val Asp Tyr Val Arg Val Trp 

290 295 300 

Glu Lys Ser Ala Gly Asn Thr Thr Asn Pro Pro Thr Ser Glu Val Gly 
305 310 315 320 

Thr Leu Lys Thr Lys Gly Ser Lys Leu Val He Asp His Trp Asp Ala 

325 330 335 

Ser Thr Gly Thr He Ser Ala Val Ser Asn Asn Thr Lys Thr Gly Gin 

340 345 350 

Tyr Ala Gly Ser Val Asn Asn Ala Ser He Ala Gin He Val Thr Leu 

355 360 365 

Lys Ala Asn Thr Ser Tyr Lys Val Ser Ala Phe Gly Lys Ala Ser Ser 

370 375 380 

Pro Gly Thr Ser Ala Tyr Leu Gly He Ser Lys Ala Ser Asn Asn Glu 
385 390 395 400 
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Leu lie Ser Asn Phe 
405 

lie Glu lie Arg Thr 
420 

Trp Ser Ser Gly Gin 
435 

Asn Ser Gly Ala Ser 
450 

Glu Lys Gly He His 
465 

Thr He Asp Phe Gly 
485 

Leu Asn Gly Arg Thr 
500 

Val Gin Leu Leu Glu 
515 

Val Lys He Ser Gly 

□ 530 
»,3 Glu 

J :3 545 



Glu Phe Lys Thr Thr 
410 

Gly Asn Val Gin Glu 
425 

Ala Tyr Cys Asp Asp 
440 

Gin Leu Asn Glu Asn 
455 

lie Tyr Pro Asn Pro 
470 

Lys Pro Phe Ser Gly 
490 

Phe Leu -Arg Arg Asn 
505 

Ser Lys Ser Lys Phe 
520 

Pro Asp Gly Glu Val 
535 



Ser Tyr Ser Lys Gly Glu 
415 

Ser Tyr Arg He Trp Tyr 
430 

Phe Asn Leu Val Glu He 
445 

Glu Thr Glu Thr Ala Leu 
460 

Tyr Lys Asn Gly Pro Leu 
475 480 
Glu Val Gin He Thr Gly 
495 

Val Val Asp Gin Thr Ser 
510 

Lys Ser Gly Leu Tyr He 
525 

Ser Lys Lys He Leu Val 
540 



